Aligning Eyes between Humans and Deep Neural Network through Interactive Attention Alignment
نویسندگان
چکیده
While Deep Neural Networks (DNNs) are deriving the major innovations through their powerful automation, we also witnessing peril behind automation as a form of bias, such automated racism, gender and adversarial bias. As societal impact DNNs grows, finding an effective way to steer align behavior with human mental model has become indispensable in realizing fair accountable models. establishing adjust "think like humans'' is pressing need, there have been few approaches aiming capture how "humans would think'' when introduce biased reasoning seeing new instance. We propose Interactive Attention Alignment (IAA), framework that uses methods for visualizing attention, saliency maps, interactive medium humans can leverage unveil cases DNN's directly attention. To realize more human-steerable than state-of-the-art, IAA introduces two novel devices. First, Reasonability Matrix systematically identify Second, applies GRADIA, computational pipeline designed effectively applying adjusted attention jointly maximize quality prediction accuracy. evaluated Study 1 GRADIA 2 classification problem. In 1, found bias detection significantly improve perceived from eyes not Matrix. 2, using improves (1) human-assessed (2) performance scenarios where training samples limited. Based on our observation studies, present implications future design problem space social computing data annotation toward achieving human-centered steerable AI.
منابع مشابه
Anomaly-based Web Attack Detection: The Application of Deep Neural Network Seq2Seq With Attention Mechanism
Today, the use of the Internet and Internet sites has been an integrated part of the people’s lives, and most activities and important data are in the Internet websites. Thus, attempts to intrude into these websites have grown exponentially. Intrusion detection systems (IDS) of web attacks are an approach to protect users. But, these systems are suffering from such drawbacks as low accuracy in ...
متن کاملHyperparameter Transfer Learning through Surrogate Alignment for Efficient Deep Neural Network Training
Recently, several optimization methods have been successfully applied to the hyperparameter optimization of deep neural networks (DNNs). The methods work by modeling the joint distribution of hyperparameter values and corresponding error. Those methods become less practical when applied to modern DNNs whose training may take a few days and thus one cannot collect sufficient observations to accu...
متن کاملWord Alignment Modeling with Context Dependent Deep Neural Network
In this paper, we explore a novel bilingual word alignment approach based on DNN (Deep Neural Network), which has been proven to be very effective in various machine learning tasks (Collobert et al., 2011). We describe in detail how we adapt and extend the CD-DNNHMM (Dahl et al., 2012) method introduced in speech recognition to the HMMbased word alignment model, in which bilingual word embeddin...
متن کاملNeural Network-based Word Alignment through Score Aggregation
We present a simple neural network for word alignment that builds source and target word window representations to compute alignment scores for sentence pairs. To enable unsupervised training, we use an aggregation operation that summarizes the alignment scores for a given target word. A soft-margin objective increases scores for true target words while decreasing scores for target words that a...
متن کاملDeep Attention Recurrent Q-Network
A deep learning approach to reinforcement learning led to a general learner able to train on visual input to play a variety of arcade games at the human and superhuman levels. Its creators at the Google DeepMind’s team called the approach: Deep Q-Network (DQN). We present an extension of DQN by “soft” and “hard” attention mechanisms. Tests of the proposed Deep Attention Recurrent Q-Network (DAR...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ACM on human-computer interaction
سال: 2022
ISSN: ['2573-0142']
DOI: https://doi.org/10.1145/3555590